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O | Abstract 

We discuss the method recently proposed by S. Chubanov for the linear feasibility 
problem. We present new, concise proofs and interpretations of some of his results. 
y—i We then show how our proofs can be used to find strongly polynomial time algorithms 

for special classes of linear feasibility problems. Under certain conditions, these results 
provide new proofs of classical results obtained by Tardos, and Vavasis and Ye. 

q 

1 Introduction 

-i— > 

In their now classical papers, Agmon [TJ and Motzkin and Schoenberg |15| introduced the 
so called relaxation method to determine the feasibility of a system of linear inequalities 
(it is well-known that optimization and the feasibility problem are polynomially equivalent 
to one another). Starting from any initial point, a sequence of points is generated. If the 
current point Z{ is feasible we stop, else there must be at least one constraint c T x < d that 
is violated. We will denote the corresponding hyperplane by (c,d). Let P( c ,d)( z i) be the 
orthogonal projection of Zi onto the hyperplane (c,d), choose a number A (usually chosen 
_^ between and 2), and dehne the new point Zi+i by Z{ + i = Z{ + X(p( c ,d)( z i) ~ z i)- Agmon, 

Motzkin and Schoenberg showed that if the original system of inequalities is feasible, then 
this procedure generates a sequence of points which converges, in the limit, to a feasible 
point. So in practice, we stop either when we find a feasible point, or when we are close 
enough, i.e., any violated constraint is violated by a very small (predetermined) amount. 
^ Many different versions of the relaxation method have been proposed, depending on how 

the step-length multiplier A is chosen and which violated hyperplane is used. For example, 
the well-known perceptron algorithms [3] can be thought of as members of this family of 
methods. In addition to linear programming feasibility, similar iterative ideas have been 
used in the solution of overdetermined system of linear equations as in the Kaczmarz's 
method where iterated projections into hyperplanes are used to generate a solution (see 

nana us]). 

The original relaxation method was shown early on to have a poor practical convergence 
to a solution (and in fact, finiteness could only be proved in some cases), thus relaxation 
methods took second place behind other techniques for years. During the height of the 
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fame of the ellipsoid method, the relaxation method was revisited with interest because 
the two algorithms share a lot in common in their structure (see [21 [91 [21] and references 
therein) with the result that one can show that the relaxation method is finite in all cases 
when using rational data, and thus can handle infeasible systems. In some special cases 
the method did give a polynomial time algorithm JT3], but in general it was shown to be 
an exponential time algorithm (see |1U|, I21|). Most recently in 2004, Betke gave a version 
that had polynomial guarantee in some cases and reported on experiments 0j. In late 2010 
Sergei Chubanov presented a variant of the relaxation algorithm, that was based on the 
divide-and-conquer paradigm. We will refer to this algorithm as the Chubanov Relaxation 
algorithm. The purpose of the Chubanov Relaxation algorithm |6j is to either find a solution 
of 



in M n , with ianraxn matrix, Canlxn matrix, b £ M m , and d € R , where the elements 
of A, b, C, and d are integers, or determine the system has no integer solutions. The 
advantage of Chubanov's algorithm is that when the inequalities take the form < x < 1, 
then the algorithm runs in strongly polynomial time. This result can then be applied to 
give a new polynomial time algorithm for linear optimization [7] . The purpose of this paper 
is to investigate these recent ideas in the theory of linear optimization, simplify some of 
his arguments, and show some consequences. 



Our Results 

We start by explaining the basic details of Chubanov's algorithm in Section[2j in particular, 
we outline the main "Divide and Conquer" subroutine from his paper. We will refer to 
this subroutine as Chubanov's D&C algorithm in the rest of the paper. Chubanov's D&C 
algorithm is the main ingredient in the Chubanov Relaxation algorithm. In the rest of 
Section [2j we prove some key lemmas about the D&C algorithm whose content can be 
summarized in the following theorem. 

Theorem 1.1. Chubanov's D&C algorithm can be used to infer one of the following state- 
ments about the system ([!]) : 

(i) A feasible solution to exists, and can be found using the output of the D&C 
algorithm. 

(ii) One of the inequalities in Cx < d is an implied equality, i.e., there exists k £ {1, ...,/} 
such that CfcX = d for all solutions to ([!]). 

(in) There exists k € {1, . . . , /} such that c^x = d for all integer solutions to ([!]). 

A constructive proof for the above theorem can be obtained using results in Chubanov's 
original paper |6J. Our contribution here is to provide a different, albeit existential, proof 
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of this theorem. We feel our proof is more geometric and simpler than Chubanov's original 
proof. We hope this will help to expose more clearly the main intuition behind Chubanov's 
D&C algorithm, which is the workhorse behind Chubanov's results. Of course, Chubanov's 
constructive proof is more powerful in that it enables him to prove the following fascinating 
theorem. 

Theorem 1.2. [see Theorem 5.1 in J($] Chubanov's Relaxation algorithm either finds a 
solution to the system 

Ax = b, , . 

< x < 1, 1 ' 

or decides that there are no integer solutions to this system. Moreover, the algorithm runs 
in strongly polynomial time. 

This is an interesting theorem and leads to a new polynomial time algorithm for Linear 
Programming [7]. However, it suffers from the drawback that it does not lead to a strongly 
polynomial time linear programming algorithm, even in the restricted setting of variables 
bounded between and 1 . Using the intuition behind our own proofs of Theorem we 
are able to demonstrate how Chubanov's D&C algorithm can be used to give a strongly 
polynomial algorithm for deciding the feasibility or infeasibility of a system like ([2]), under 
certain additional assumptions. In particular, we show the following result about bounded 
linear feasibility problems in Section [3] 

Theorem 1.3. Consider the linear program given by 

Ax = b, , . 

< x < XI. { ' 

Suppose A is a totally unimodular matrix and A is bounded by a polynomial in n, m (the 
latter happens, for instance, when A = 1). Furthermore, suppose we know that if ^ is 
feasible, it has a strictly feasible solution. Then there exists a strongly polynomial time algo- 
rithm that either finds a feasible solution of or correctly decides that the system is infea- 



If A = 1, then the running time can be upper bounded by 0(m 3 + m 2 n + n 2 m + n ). 

This theorem partially recovers E. Tardos' result on combinatorial LPs [20J. Tardos' 
results were also obtained by Vavasis and Ye using interior point methods [22], which 



is different from Tardos' approach. Our Theorem 1.3 proves a weaker version of these 
classical results using a completely different set of tools, inspired by Chubanov's ideas. 
Tardos' result is much stronger because she does not assume any upper bounds on the 
variables (A = oo), does not assume strictly feasible solutions, and only assumes that the 
entries of A are polynomially bounded by n,m. Nevertheless our result has some interest 
as the techniques are completely different from those in [20!, [22J. 
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We also show that Chubanov's D&C subroutine can be used to construct a new algo- 
rithm for solving general linear programs. This is completely different from Chubanov's 
linear programming algorithm in [7]. However, the general algorithm that we present in 
this paper is not guaranteed to run in polynomial time. On the other hand, it has the 
advantage of avoiding some complicated reformulations that Chubanov uses to extract a 
general purpose LP algorithm from the Chubanov Relaxation algorithm. Moreover, we can 
solve the problem in a single application of the D&C subroutine, whereas the Chubanov 
Relaxation algorithm needs multiple applications. 

2 Chubanov's Divide and Conquer Algorithm 

In this section we will outline Chubanov's main subroutines for the D&C algorithm as 
presented in jB]. 

First, a couple of assumptions and some notation. We assume the matrices A and C 
have no zero rows and that A is of full rank. Note if A does not have full rank we can easily 
transform the system into another, A'x = b', Cx < d, such that A 1 has full rank without 
affecting the set of feasible solutions. Let o« denote the i-th row of A and denote the 
/c-th row of C. P will denote the set of feasible solutions of Q. Finally, B(z, r) will denote 
the open ball centered at z of radius r in M. n . 

One new idea of Chubanov's algorithm is its use of new induced inequalities. Unlike 
Motzkin and Schoenberg [15j . who only projected onto the original hyperplanes that de- 
scribe the polyhedron P (see top of Figure [I]) , Chubanov constructs new valid inequalities 
along the way and projects onto them too (bottom part of Figured). 



The aim of Chubanov's D&C algorithm is to achieve the following. Given a current 
guess z, a radius r and an error bound e > 0, the algorithm will either: 




Figure 1: Chubanov's method generates new inequalities on the way. 
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(1) Find an e- approximate solution x* € B(z,r) to ([!]), i.e. some x* such that 

Ax* = b, Cx* < d + el, 

(2) Or find an induced hyperplane hx = 5 that separates B(z, r) from P. 

This task is achieved using a recursive algorithm. In the base case, Chubanov uses a 
subroutine called Chubanov's Elementary Procedure (EP), which achieves the above goal if 
r is small enough; namely, when r < — w, where ||c ma x|| = rnaxi<fc<j(||cfc||). Let P(A,b){ z ) 
denote the projection of z onto the arfine subspace defined by Ax = b. 

Algorithm 2.1. THE ELEMENTARY PROCEDURE 

Input: A system Ax = b, Cx, < d and the triple (z, r, e) where r < — n. 
Output: Either an e-approximate solution x* or a separating hyperplane hx = 5. 

If Hp^fe)^) — z\\ < r and ° k \^ k < r for all k 

Then x* = P(A.b)( z ) ls an e-approximate solution (see Figure [2| 
Else If \\p {Atb) (z) -z\\ >r 

Then let h = (z — P(A,b){ z )) T an d 5 = h ■ P(A,b)( z ) ( see Figure [3]) 

Else Cfc 1 1 2 f k ° > r for some index kn 
\\c k0 \\ - u 

Then let h = Ck and 5 = dk (see Figure |4|) 
End If 




Figure 2: The projection p(z) is an e-approximate solution. 

Note Ch n^t k tells us how far z is from the hyperplane c^x = d^, and it is in fact negative 

if z satisfies the inequality CkX < dk- Thus if Ck ^ k \\ k ^ r ' then any point in B(z,r) is an e- 
approximation of c^x < d^. This simple observation is enough to see that the EP procedure 
solves the task when r < tm~^ — it- See Chubanov |6j Section 2 for more details and proofs. 

The elementary procedure achieves the goal when r is small enough, but what happens 
when r > tjtt— - — rr? Then the D&C Algorithm makes recursive steps with smaller values of 

^||C m ax| 
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Figure 3: A separating hyperplane is given by the projection direction {z — p{z)) T . 




r. To complete these recursive steps, the D&C algorithm uses additional projections and 
separating hyperplanes. We give more details below. Figure [5] illustrates some of the steps 
in this recursion. A sample recursion tree is shown in Figure [6j 





Figure 5: The recursive step in D&C works on two smaller balls with centers z and zq 



Algorithm 2.2. THE D&C ALGORITHM 

Input: A system Ax = b, Cx < d and the triple (z, r, e). 

Output: Either an e-approximate solution x* , or a separating hyperplane hx = 5. 



If r < TTTT-^ n 

Cmax 1 1 



Then run the EP on the system 
Else recursively run the D&C Algorithm with (z, j^r, e) 
End If 



If the recursive call returns a solution x* 
Return x* 

Else let h\x = 8\ be returned by the recursive call 
End If 

Set zq = z — h ]^~^ ■ hi, i.e., project z onto (hi, Si) (see Figure [H] 
Run the D&C Algorithm with (zq, j^r, e) 

If the recursive call returns a solution x* 
Return x* 

Else let h%x = 82 be returned by the recursive call 
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End If 



If hi = —7/12 for some 7 > 

Then STOP, the algorithm fails 
Else Find a such that h = ah± + (1 — a)/i2, 5 = a5\ + (1 — 0)82, and > r 

Return hx = 5 End If 




Figure 6: D&C Recursion Tree 

We now state the running time of the Chubanov D&C Algorithm. 

Proposition 2.1. [Theorem 3.1 in JSj/] The matrix A is assumed to have m rows and n 
columns and let N be the number of non-zero entries in the matrix C . Let a = no „ 2e — no 
where c ma x is the row of C with maximum norm. Let p be a number such that \ zj\ < p and 
pzj is an integer for all components Zj of the center z in the input to the D&C algorithm. 
Let K = ( r W c ™ ax W ) i°9 2 (7/5) . 

Chubanov 's D&C algorithm performs at most 

0(m 3 + m 2 n + n 2 m + K(nlog( P + l ° 9 ^ r ± U ^ ) + n 2 + N)) (4) 

p 

arithmetic operations. 

2.1 Using the D&C Algorithm 

One possible way to exploit the D&C algorithm is the following. Since D&C returns e- 
approximate solutions, we can try to run it on the system Ax = b,Cx < d — el; if the 
algorithm returns an e-approximate solution, we will have an exact solution for our original 
system Ax = b, Cx < d. However, we need a z and an r as input for D&C. To get around 
this, we can appeal to some results from classical linear programming theory. Suppose 
one can, a priori, find a real number r* such that if the system Ax = b, Cx < d — el is 
feasible, then it has a solution with norm at most r* . In other words, there exists a solution 
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in B(0,r*), if the system is feasible. Such bounds are known in the linear programming 
literature (see Corollary 10.2b in Schrijver [IE]), where r* depends on the entries of A, b, C, d 
and e. Then one can choose z = and r = r* for the D&C algorithm. If the algorithm 
returns an e-approximate solution, we will have an exact solution for our original system 
Ax = b, Cx < d; whereas, if the algorithm returns a separating hyerplane, we know that 
Ax = b, Cx < d — el is infeasible by our choice of r. 
This strategy suffers from three problems. 

1. There is a strange outcome in the D&C procedure - when it "fails" and stops. This 
occurs when the two recursive branches return hyperplanes with normal vectors h\, hi 
with h\ = —7/12 for some 7 > 0. It is not clear what we can learn about the problem 
from this outcome. Later in this paper, we will interpret this outcome in a manner 
that is different from Chubanov's interpretation. 

2. It might happen that Ax = b,Cx < d — el is infeasible, even if the original system 
Ax = b, Cx < d is feasible. In this case the algorithm may return a separating 
hyperplane, but we cannot get any information about our original system. All we 
learn is that Ax = b, Cx < d — el is infeasible. 

3. Finally, the running time of the D&C algorithm is a polynomial in n, m and r = r* . 
Typically, the classical bounds on r* are exponential in the data. This would mean 
the running time of the algorithm is not polynomial in the input data. 

Let us concentrate on tackling the first two problems, to progress towards a correct 
linear programming algorithm. Then we can worry about the running time. This requires 
a second important idea in Chubanov's work. We address the first two problems above by 
homogenizing, or parameterizing, our original system, and we show how this helps in the 
rest of this section. Geometrically this turns the original polyhedron into an unbounded 
polyhedron, defined by 

Ax - bt = 0, 

Cx-dt< 0, (5) 
-t < -1. 

Note this system ^ is feasible if and only if ([I]) is feasible. Let (x*,t*) be a solution to 
Then |» is a solution of 0. Similarly, if x* is a solution of Q, then (x*, 1) is a solution 
of gj. Thus we can apply the D&C to a strengthened parameterized system 

Ax - bt = 0, 

Cx-dt< -el, (6) 
-t < -1 - e 

and any e-approximate solution will be an exact solution of ([5]), and thus will give us 
an exact solution of 0. We still need to figure out what z and r to use. For the rest 
of the paper, we will use e = 1 as done by Chubanov in his paper. It turns out that if 
we choose the appropriate z and r, we can get interesting information about the original 
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system Ax = b, Cx < d when D&C fails, or returns a separating hyperplane. We explain 
this next. 

Let us summarize before we proceed. Given a system ([!]) we parameterize and then 
strengthen with e = 1 (to obtain a system in the form ([6])). Then we apply the D&C to 
^ with appropriately chosen z and r. Our three possible outcomes are: 

(I) The D&C gives us a solution (x*,t*) which is an e = 1-approximate solution to 
This is the best possible outcome, because we can then return the exact solution |r- 
to 0. 



(II) The D&C fails. The reader can look ahead to our Proposition 2.4 for an interpreta- 
tion of this outcome. 



(Ill) The D&C returns a separating hyperplane hx = 8. Our Proposition 2.5 tells us how 
to interpret this outcome. 

In the rest of this section, we give some more geometric intuition behind the process of 
homogenizing the polyhedron and why it is useful. Our goal will be to prove Theorem 

2.2 Meaning of "Failure" Outcome in D&C 

First we show that if the Chubanov D&C algorithm fails on a particular system, then in 
fact that system is infeasible. This observation is never made in the original paper by 
Chubanov [6] and, as far as we know, is new. 

Proposition 2.2. Suppose Chubanov's D&C Algorithm fails on the system Ax = b,Cx < 
d, i.e., it returns two hyperplanes h\x = 8\ and hix = 82 with h\ = — 7/12 with 7 > 0. 
Then the system Ax = b, Cx < d is infeasible. 

Proof. Let P = {x 6 W 1 \ Ax = b,Cx < d}. If Chubanov's algorithm fails, then there 
exists z E M n ,r > such that the following two things happen : 

(i) h\x < 5\ is valid for P and for all y G B(z, j^r), h\y > 8\. 

(i) h2X < 62 is valid for P and for all y £ B(zq, jj^r), h2y > 82, where zq = z— ^T^ 1 - hi. 
Since z - 2jT+ff)W\ e B ^ TTe r ), h 2 • (z - 2(1 r +e) ^y ) > 8 ± . Therefore, 

h2Z0 ~ 271T0) > 52 - (7) 

Now we use the fact that h\ = —7/12 and so —-hi = hi which we substitute into ([7]) 

to get —^h\Zo — 2(1+0) > ^2- F rom the definition of zq, we have hiZo = 8\ . Therefore, 

— ^81 > 2(i+0) +^2 > ^2- So 81 < —7^2- Now hix < 81 is valid for P using (i) above. Using 
hi = — 7/12 and 81 < —7^2, we get — 7/12X < —782 is valid for P, i.e., h%x > 82 is valid for 
P. But we also know that h2X < 82 is valid for P from (ii) above. This implies that P = 
and the system Ax = b, Cx < d is infeasible. □ 
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Strengthened homogenized region 

Ax — bt = 0, Cx — dt < —1, —t < —2 is nonempty 




Ax = 6, Cic < d (non-degenerate) Ax = b, Cx < d (degenerate) 



Figure 7: This Figure illustrates our Lemma 2.3 In the left figure, the original feasible re- 
gion is non-degenerate and so the strengthened, homogenized cone is nonempty. The figure 
on the right shows an example where the original system is degenerate; the strengthened 
cone is empty because we have two parallel hyperplanes which get pushed away from each 
other, creating infeasibility in the strengthened system. 



We now interpret the "failure" outcome of the D&C algorithm on the strengthened 
parameterized system. First we prove the following useful lemma. 

Lemma 2.3. If the system Ax — bt = 0, Cx — dt < — 1, —t < —2 is infeasible, then there 
exists I G {1, . . . , k} such that c\ - x = d\ for all x satisfying Ax = b, Cx < d. 

Proof. We prove the contrapositive. So suppose that for all k G {I,..., I}, there exists 
Xk satisfying Axk = b, Cxt < d with Ck ■ Xk < dk- Then using x = j(x\ + . . . + xi), we 
get that Ax = b and c^x < d^ for all k G {1, . . . , /}. Let % = d^ — ■ x > and let 
n = mm{^,r]i, . . . , 77/}. Therefore, 77 > 0. Let x* = | and t* = -. Then 

Ax* -bt* = -(Ax-b) = 0. 
rj 

For every k G {1, . . . , I}, 

d k t* - c k x* = -(d k - c k x) = — > 1. 

rj rj 

Therefore, c k x* — d k t* < —1 for every k G {1, . . . , I}. Finally, since t = - > 2 by definition 
of 77, we have —t < —2. □ 

An illustration of the above lemma appears in Figure [7j The following is the important 
conclusion one makes if the D&C algorithm "fails" on the strengthened parameterized 
system. 
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Strengthened homogenized region Strengthened homogenized region 
is closer to the origin is farther from the origin 




Ax = b, Cx < d 



Figure 8: This Figure illustrates our Proposition |2.5| When the original feasible region is 
"thinner" , the strengthened homogenized cone is pushed farther away from the origin. 



Proposition 2.4. If Chubanov's D&C algorithm fails on the system Ax — bt = 0, Cx — dt < 
— 1,—t < —2, then there exists I G {l,...,k} such that q • x = di for all x satisfying 
Ax = b,Cx < d. 



Proof. Follows from Proposition 2.2 and Lemma 2.3 



□ 



2.3 The meaning of when a separating hyperplane is returned by D&C 

We now make the second useful observation about the strengthened parameterized system 
Ax — bt = 0, Cx — dt < —1, —t < —2. Suppose we know that all solutions to Ax = b, Cx < d 
have norm at most r* . Then we will show that if all solutions to the strengthened system 
are "too far" from the origin, then the original system is "very thin" (this intuition is 
illustrated in Figure [8]) . More precisely, we show that if all solutions to the strengthened 
parameterized system have norm greater than 2k{r* + 1), then there exists an inequality 
Qx < di such that all solutions to Ax = b, Cx < d satisfy di — | < qx. That is, the original 
polyhedron lies in a "thin strip" d\ — | < c\x < d\. This would then imply that all integer 
solutions to Ax = b,Cx < d satisfy C[X = d[ since c\,d\ have integer entries. Here is the 
precise statement of this observation. 

Proposition 2.5. Suppose r* E R is such that \\x\\ < r* for allx satisfying Ax = b, Cx < d. 
If \\(x,t)\\ > 2k(r* + 1) for all (x,t) satisfying Ax - bt = 0, Cx - dt < -1,-t < -2, then 
there exists I 6 {1, . . . , k} such that di — \ < c\x < di for all x satisfying Ax = b, Cx < d. 

Proof. Suppose to the contrary that for all j G {!,...,&}, there exists x^ such that Ax^ = 



b, Cx J < d, and CjX 3 < d? 



2 ■ 



i.e.. 



Cj {2xi) 



2dj < 



T. This implies that the following 
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equations hold for all j G {1, . . . , k} 



cj(2xi) - 2d j < -1, 

c j (2x i ) -2d j < Vi^j. w 

Now consider x = X)j=i and i = 2k. It is easily verified Ax — bi = since Ax^ = b for all 
j € {1, . . . , k}. Moreover, adding together the inequalities in Q, we get that CjX — dji < — 1 
for all j £ {1, . . . , k}. Therefore, (x, t) satisfies the constraints Ax — bt = 0, Cx — dt < 
-l,-t<-2. 

Finally, \\(x,t)\\ < \\x\\ +2k < Ylj=i 2 IMI +2k < 2k(r* + 1), where the last inequality 
follows from the fact that all solutions to Ax = b, Cx < d have norm at most r* . We have 
thus reached a contradiction with the hypothesis that ||(x,i)|| > 2k(r* + 1) for all (x,t) 
satisfying Ax - bt = 0, Cx - dt < -1, -t < -2. □ 

The above proposition shows that if Chubanov's D&C algorithm returns a separating 
hyperplane separating the feasible region of Ax — bt = 0, Cx — dt < — 1, — t < —2 from the 
ball -B(0, 2k(r* + 1)), one can infer that there exists an inequality qx < d[ that is satisfied 
at equality by all integer solutions to Ax = b,Cx < d. 

We now have all the tools to prove Theorem 

Proof of Theorem \l.l\ As discussed earlier, we can assume that there exists r* such that 
||x|| < r* for all x satisfying Ax = b,Cx < d. We then run the D&C algorithm on 
the strengthened system ^ with e = 1, z = and r = 2k(r* + 1). Recall the three 
possible outcomes of this algorithm. In the first outcome, we can find an exact solution 
of Ax = b,Cx < d - this is (i) in the statement of the theorem. In the second outcome, 



when D&C fails, Proposition 2.4 tells that we have an implied equality, which is (ii) in 



the statement of the theorem. Finally, in the third outcome, D&C returns a separating 



hyperplane. By Proposition 2.5, we know that there exists an inequality qx < d[ that is 
satisfied at equality by all integer solutions to Ax = b, Cx < d. This is (iii) in the statement 
of the theorem. □ 



2.4 Chubanov's proof of Theorem 1.2 



Chubanov is able to convert the existential results of Propositions |2.4| and |2.5| into con- 
structive procedures, wherein he can find the corresponding implied equalities in strongly 
polynomial time. He then proceeds to apply the D&C procedure iteratively and reduce the 
number of inequalities by one at every iteration. One needs at most I such iterations and 
at the end one is left with a system of equations. This system can be tested for feasibility 
in strongly polynomial time by standard linear algebraic procedures. This is the main idea 



behind the Chubanov Relaxation algorithm and the proof of Theorem 1.2 that appears 
in [61. 
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3 Linear feasibility problems with strictly feasible solutions 



We will now demonstrate that using the lemmas we proved in Section [2j we can actually 
give strongly polynomial time algorithms for certain classes of linear feasibility problems. 



More concretely, we aim to prove Theorem 1.3 Consider a linear feability problem in the 
following standard form. 

Ax = b, 

-x < 0, 1 J 

where A G W nxn and b G M. m . We will assume that the entries of A and b are integers 
and that A has full row rank. Let 

Aa = m&x{\det(B)\ \ B is an n x n submatrix of A} 

be the maximum subdeterminant of the matrix A. Let P(A, b) = {x G W 1 \ Ax = b, —x < 
0} denote the feasible region of ([9]). 

Lemma 3.1. Let x be any vertex of P(A,b). If Xi > for some i G {l,...,n}, then 
x i ^ SI- 

Proof Since x is a vertex, there exists a nonsingular n x n submatrix B of A such that x 
is the basic feasible solution corresponding to the basis B. That is, the non basic variables 
are and the basic variable values are given in the vector B~ 1 b. If for some i, x\ > then 
this is a basic variable and therefore its value is at least ^ de ^ B ^ since b is an integer vector. 
Since \det(B)\ < Aa, we have that Xi > □ 

Lemma 3.2. Suppose we know that ^ has a strictly feasible solution, i.e. there exists 
x G M n such that Ax = b and Xi > for all i G {1, . . . ,n}. // P(A, b) is bounded, then 
there exists a solution x* G W 1 such that Ax* = b and x* > for all i G {1, . . . , n}. 

Proof. Since we have a strictly feasible solution and P(A, b) is a polytope, then for every 



i G {1, . . . , n } there exists a vertex x l of P such that x\ > 0. By Lemma 3.1 we have that 



x) > -J-. Therefore, if we consider the solution 



n 

i 



X = — > X 

n 

i=i 



i.e., the convex hull of all these n vertices, then x* > for all i G {1, . . . , n}. □ 

We now consider linear feasibility problems of the form ([£]) such that either it is infeasi- 
ble or has a strictly feasible solution. We will present an algorithm to decide if such linear 
feasibility problems are feasible or infeasible. We call this algorithm the LFS Algorithm, 
as an acronym for Linear Feasibility problems with Strict solutions. As part of the input, 
we will take Aa, as well as a real number r such that P(A,b) C -6(0, r), i.e., ||x|| < r for 
all feasible solutions x. The running time of our algorithm will depend on A^r, and n. 
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Algorithm 3.1. THE LFS ALGORITHM 

Input: Ax = b, —x < 0, such that either this system is infeasible, or there exists a strictly 
feasible solution. We are also given a real number r such that P(A,b) C B(0,r), i.e., 
||x[| < r for all feasible solutions x. Moreover, we are given as part of the input. 
Output: A feasible point x* or the decision that the system is infeasible. 
Parameterize ([9]): 

Ax - bt = 0, 

-x < 0, (10) 
-t < -1. 



Run Chubanov's D&C subroutine on the strengthened version of (10): 



Ax - bt = 0, 

-x < -1, (11) 
-t < -2. 

with z = 0, f = 2nA J 4v / ^ 2 + 1, and e = 1 

If Chubanov's D&C subroutine finds an e- feasible solution (x*,t*) 

Return feasible solution for 

Else (Chubanov's D&C subroutine fails or returns a separating hyperplane) 

Return "The system Q is INFEASIBLE" 

Theorem 3.3. The LFS Algorithm correctly determines a feasible point x* of Q or 
determines the system is infeasible. The running time is 

+ m 2 n + n 2 m + (2uAaV r 2 + 1) log2 ( s) (n 2 + nlog + nlog(r)) j 




Proof. We first confirm the running time. We will use Proposition 2.1 Observe that for 
our input to the D&C algorithm, ||c max || = 1, e = 1 and f = 2n Al7r 2 + 1 and N 



n. 



i 



Therefore, K = (2nA AV / r 2 + l) log2 (s) , p = since we use the origin as the initial center 
for the D&C algorithm, \i = j^. Substituting this into Q, we get the stated running time 
for the LFS algorithm. 

To prove the correctness of the algorithm, we need to look at each of the three cases 
Chubanov's D&C can return. 



CASE 1: An e-approximate solution (x*,t*) is found for (17). Then (x*,t*) is an exact 
solution to (10), and fr is a solution to 



CASE 2: The D&C algorithm fails to complete, returning consecutive separating hyper- 



planes (h\, 5\) and (/12, £2) such that h\ = — 7/12 for some 7 > 0. Then Proposition 2.4 says 
that there exists i G {1, ... ,71} such that X{ = for all feasible solutions to ([9]). But then 
the original system ([9]) has no strictly feasible solution, and by our assumption, is therefore 
actually infeasible. Therefore if the D&C fails, our original system Ax = b, —x < is 
infeasible. 
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CASE 3: The final case is when the D&C algorithm returns a separating hyperplane {h, 5), 



separating B(0, r) from the feasible set of ( 11 ). We now show that this implies the original 
system Ax = b, —x < is infeasible. 

If not, then from our assumption, we have a strictly feasible solution x* for Q. By 



Lemma 



3.2 



> 



we know that x* 
(2nA A x\2nA A ) in R n+1 . We show that (x,t) is feasible to (fTO) 



nA, 



for all i £ {1, . . . , n}. Consider the point (x,F) 

Since Ax* = b, we 

have that Ax — bt = 0. Moreover, since x* > f° r au * ^ {1, . . . , n}, we have that 
x = 2nA A x* > 1, i.e., — x < —1. Finally, i = 2nA A > 2, since > 1 and n > 1. 
Therefore, — t < —2. Now we check the norm of this point = ||(2nA J 4X*, 2nA^)|| = 

2nAA||(x*, 1)||. Since x* € P(A,b) C 5(0, r), we know that \\x*\\ <r. Therefore, < 



f. So (x, t) is a feasible solution to ( 11 ) and (x, t) G -B(0, f). But D&C returned a separating 
hyperplane separating l?(0,f) from the feasible set of ( |11[ ). This is a contradiction. Hence, 
we conclude that Ax = b, —x < is infeasible. □ 



Corollary 3.4. Consider the following system. 

Ax = b, 
0< x < XI. 



(12) 



Suppose that we know that the above system is either infeasible, or has a strictly feasible 
solution, i.e., there exists x such that Ax = b and < x < XI. Then there exists an 



algorithm which either returns a feasible solution to (12), or correctly decides that the 
system is infeasible, with running time 



O m 3 + m 2 n + n 2 m + (2nA A X^/2n + l) loS2 (s) (n 2 + nlog A A + nlog(A)) 



Proof. We first put (12) a standard form. 



Ax = b, 
x + y = XI, 
-x < 0, 

-y<o. 



We will use the constraint matrix of ( 13 ) : 



A 



A O m xn 
In. In. 



(13) 



where mX n denotes the mxn matrix with all entires, and I n is the n x n identity 
matrix. Therefore, P(A, [b, Al]) is the feasible set for (13). Also, note that A^ = A^. 



Since < x,y < XI for all (x,y) G P(A, [b, Al]), we know that P(A, [b, 1]) C B(0,Xy/2n). 
Therefore, we run LFS with r = X\[2/n and the system (13) as input. Observe that since 



(12) has a strictly feasible solution, so does (13). By Theorem 3.3, LFS either returns a 



feasible solution to (13) which immediately gives a feasible solution to (12), or correctly 
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decides that (13) is infeasible and hence (12) is infeasible. Moreover, the running time for 
LFS is 



O y(m 3 + m 2 n + n 2 m + (2nA^Jr 2 + l) log2 (s) (n 2 + ralog A^ + ralog(r)) J 
O ( m 3 + m 2 n + n 2 m + (2nA A Xy/2n + l) log2 (s) ( n 2 + nlog A A + nlog(A)) 



□ 



Corollary 3.4 is related to the following theorem of Schrijver. Let A A = max{| det(B , 



B is a nonsingular submatrix of A}. 

Theorem 3.5 (Theorem 12.3 in [H]). A combination of the relaxation method and the 
simultaneous diophantine approximation method solves a system Ax < b of rational linear 
inequalities in time polynomially bounded by size(A,b) and by A A . 

On one hand, we have the additional assumptions of being bounded and having strictly 
feasible solutions. On the other hand, we can get rid of the dependence of the running time 
on size(A, b) and the use of the simultaneous diophantine approximation method, which 
utilizes non-trivial lattice algorith ms. It is also not immediately clear how A A is related 
to A a- We finally prove Theorem 1.3 



Proof of Theorem \1.5\ If A is totally unimodular, then A^ = 1. The result now follows 
from Corollary 3.4 □ 



4 A General Algorithm for Linear Feasibility Problems 

In this section, we describe an algorithm for solving general linear feasibility problems using 
Chubanov's D&C algorithm and the ideas developed in Section [2] We call this algorithm 
the LFG Algorithm, as an acronym for Linear Feasibility problems in General. Before 
we can state our algorithm and prove its correctness, we need a couple of other pieces. 

Lemma 4.1. [Schrijver Corollary 10.2b] Let P be a rational polyhedron in W 1 of facet 
complexity <p. Define 

Q = p n {x G R n | - 2 5n ^ < Xi < 2 5n2 * for i = 1, . . . , n}. 

Then dim(P) = dim(Q). 

Thus, if we use f = 2 5 ™ 2<?i y / n then B(0, f) will circumscribe the hypercube in the above 
lemma from Schrijver. We will also have dim(P) = dim(P n .6(0, r)). 

The second piece we need comes from Papadimitriou and Stieglitz [T7j. Simply put, the 
theorem states that ([!]) is feasible if and only if some other system is strictly feasible (i.e. 
all the inequalities are strict inequalities). The lemma stated below is a stronger version 
of their Lemma 8.7 in |17j . 



17 



Lemma 4.2. [Lemma 8.7 in fTTtf ] The system is feasible if and only if 



Ax = b, 
Cx <d + v. 



(14) 



is feasible, where v = 2 2T when T is the size of the input data. Moreover, given a solution 
to (14), we can construct a solution to in strongly polynomial time. 



Algorithm 4.1. THE LFG ALGORITHM 
Input: Ax = b, —x < 0. 

Output: A feasible point x* or the decision the system is infeasible. 
Set v = 2~ 2T where T is the bit length of the input data 
Set a new system 

Ax = b, 
Cx<d+^. 



Parameterize (15): 



Ax - bt = 0, 

Cx - (d+f)t < 0, 

-t < -1. 



(15) 



(16) 



Run Chubanov's D&C subroutine on the strengthened version of ( 16 ) 

bt 



Ax 
Cx- 
-t < 



0, 

(d+|)i< 
-2. 



■1, 



(17) 



1 



with z = 0, r = r from Lemma 3.2 applied to (17), and e 
If a feasible solution (x* ,t*) is found 

Return x from the proof of Lemma 3.3, with xq = f» 
Else Lemma 3.1 implies one of our inequalities is an implied equality 



Return "The system ( 15 ) is INFEASIBLE" 



Theorem 4.3. The LFG Algorithm correctly determines a feasible point x* of ([T]) 
determines the system is infeasible in a finite number of steps. 



or 



Proof. To prove the correctness of the algorithm, we need to look at each of the three cases 
Chubanov's D&C can return. 

CASE 1: An e-approximate solution (x*,t*) is found for (17). Then (x*,t*) is an exact 



solution to (16), and is a solution to (15), and hence is also a solution to (14). Using 



Lemma 4.2, we can construct a feasible solution to our original system ([T]). 
CASE 2: The D&C algorithm fails to complete, returning consecutive separating hyper- 
planes (hi,5i) and ^2,62) such that hi = —7/12 for some 7 > 0. By Proposition 2.4 
we know that there exists k G {1,...,/} such that c^x = di + | for all solutions to 
Ax = b, Cx < d + |. But this simply implies that Ax = b, Cx < d is infeasible. 
CASE 3: The final case is when the D&C returns a separating hyperplane (h,S). Note 
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that due to the r we use, this already implies (17) is infeasible. Then by Lemma 2.3, we 
know there exists some I such that = di + | for all x satisfying Ax = b,Cx < d + \. 
Again, as in Case 2, this implies that Ax = b, Cx < d is infeasible. □ 

Since the running time of the D&C subroutine is a polynomial in f, and f is exponential 
in the input data, our algorithm is not guaranteed to run in polynomial time. However, 
as mentioned in the Introduction, it has certain advantages over Chubanov's polynomial 
time LP algorithm from [7j. Firstly, it avoids the complicated reformulations used by 
Chubanov, which can potentially cause the actual runtime of his algorithm to be bad 
in practice. Secondly, the Chubanov Relaxation algorithm requires multiple iterations of 
the D&C subroutine, whereas the LFG algorithm uses only one application of the D&C 
subroutine. 



5 Computational Experiments 

In this final section, we investigate the computational performance of the various relax- 
ation methods mentioned in the preceding sections. We used MATLAB to implement the 
following algorithms. 

1. The Chubanov D&C algorithm, as described in Section [2] We use this algorithm on 
linear feasibility problems in the following way. We choose z = 0, e = 1 x 10~ 6 and r 
is taken as y/n + 1 for binary problems, and it is taken as the input dependent bound 



given in Lemma 4.1 This would mean that when the algorithm terminates, we either 



have an e-approximate solution, or we conclude that the system is infeasible. 

The Chubanov Relaxation algorithm, as described in [BJ. Apart from the input data 
A, b, C, d, this algorithm requires as input a real number r such that the feasible 
region is contained in B(0, r). As with the Chubanov D&C algorithm above, r is 
taken as y/n + 1 for binary problems, and it is taken as the input dependent bound 



given in Lemma 4.1 for general linear feasibility. 



3. The LFS algorithm, as described in Section [3j This algorithm also requires as input 
a real number r such that the feasible region is contained in B(0, r). As with the 
Chubanov D&C algorithm above, r is taken as y/n + 1 for binary problems, and it is 
taken as the input dependent bound given in Lemma [4. 1| for general linear feasibility 
problems. Moreover, we require as input A^, the maximum subdeterminant of the 
matrix A. We use the standard bound A^ < \a max \ n , where a max is the entry of 
A with largest absolute value. 

4. Two versions of the original relaxation algorithm developed by Agmon |1J, and 
Motzkin and Schoenberg [15] . 

Recall that the Chubanov Relaxation algorithm is not really a linear feasibility algo- 
rithm; it may sometimes report that the solution has no integer solutions (see the statement 
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of Theorem 1.2). However, we feel it is still interesting to study its practical run time, and 
compare it with our linear feasibility algorithms which are also based on the Chubanov 
D&C algorithm. 

It has already been shown that, for most purposes, the original relaxation method 
is not able to compete with other linear programming algorithms \10\ 12 lj . Thus, the 
purpose of these experiments is not to compare the running times with current commercial 
software. Rather, we want to determine how the new relaxation-type algorithms, based on 
the Chubanov D&C algorithm, compare with the original relaxation algorithm suggested 
by Agmon, and Motzkin and Schoenberg. Despite the improved theoretical performance of 
the new algorithms, the tables below show that, in practice, the original relaxation method 
is by far the preferred method in almost every case. 

As noted above, the algorithms and experiment scripts were all developed in MATLAB 
7.12.0. No parallelization was incorporated into the algorithms. The computational experi- 
ments were run on a personal computer with an Intel Core i5 M560 2.67 GHz processor. The 
problems used were drawn from the Netlib repository j8] , the MIPLIB repository [5j H3] , 
Hoffman's experiments [11], Telgen's and Goffin's example |10| I21j which shows the expo- 
nential behavior of the original relaxation method, and some randomly generated problems. 



The code and the problems used are available at http : / / www . math . ucdavis . edu/ ~mj unod/ 



In every table, a dash "— " denotes an experiment that exceeded our default time limit of 
10 minutes. For example, in Table [T] Chubanov 's algorithm timed out on the third Telgen 
experiment. Given the size and complexity of the problems involved, we determined that 
any algorithm exceeding 10 minutes had already shown how practically inefficient it was 
for that problem. When all of the algorithms timed out on a single problem, the results 
for the problem are not reported in the tables. We note here that for some experiments 
we rely on some familiarity with the problem and determine our own bound for r in the 
algorithms, to speed up the computations. All of the run times reported are in seconds. 

We note here that our LFS algorithm timed out on all instances tried. The Chubanov 
Relaxation algorithm and the Chubanov D&C algorithms timed out on all Netlib and 
MIPLIB problems, as well as on all the problems from Hoffman's experiments. Hence, the 
LFS algorithm is not reported in any table. Results for the Chubanov Relaxation algorithm 
and the Chubanov D&C algorithms on the Telgen examples are reported in Table [TJ and 
their results on the random — 1 instances are reported in Table [3} 

Table [2] compares two variants of the original relaxation method. The two versions 
of the original relaxation method differ only in how the violated constraint is chosen. See 
[H [15] for a full explanation of the algorithm. In the first implementation, called "Regular" 
in the tables, we chose the maximally violated constraint as specified by Agmon, and 
Motzkin and Schoenberg. Our second implementation, called "Random" in the tables, 
randomly chooses a violated constraint to see if there is any practical gain, as Needell's 
paper on the Kaczmarz method jTB] suggests might be possible. Every time we ran the 
"Random" version on a problem, we ran it 100 times and we are reporting the average 
number of iterations, time in seconds, and the standard deviation of each data set. In 
the experiments labeled Telgen, only two constraints exist in the problem and only one 
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Table 1: Telgen Results for the Chubanov Relaxation Algorithm and the Chubanov D&C 
Algorithm 





Chubanov Relaxation 


Chubanov D&C 


Experiment 


Recursions 


Time 

(Sec) 


Recursions 


Time 
(Sec) 


Telgen (a = 1) 


139254 


30.4761 


51 


0.0010 


Telgen (a = 2) 


2.12991 x 10 6 


451.817 


109 


0.0177 


Telgen (a = 3) 






116 


0.0211 


Telgen (a = 4) 






124 


0.0169 


Telgen (a = 5) 






132 


0.0218 



is violated at any iteration. Thus only the "Regular" version is reported as the two gave 
identical results. For every experiment we set A = 1.9 as a higher over-projection constant 
increases the speed of convergence, and let e = 1 x 10~ 6 be our error constant. 

Table[3]compares the performance of the Chubanov Relaxation algorithm, the Chubanov 
D&C algorithm and the original relaxation algorithm, on the randomly generated 0-1 prob- 
lem set. These were problems of the form Ax = b, < x < 1 with the dimension noted in 
each row. We limited ourselves to a randomly generated 0-1 matrix A with anywhere from 
1 to n — 1 rows, also randomly chosen, and populated b with integers randomly chosen 
from the set {1, . . . ,n}. For each dimension, we generated 10 random problems and then 
reported the average behavior along with the standard deviation. Note that for the D&C 
algorithm, the high standard deviations indicate that for the vast majority of the problems 
it ran quickly. 

Despite what appears to be the reasonable performance of the Chubanov D&C algo- 
rithm with the Random 0-1 problems, both the Chubanov Relaxation algorithm and the 
LFS algorithm performed much worse (in fact, the LFS algorithm timed out on all in- 
stances). Ironically, it is actually the D&C subroutine in these algorithms that causes this. 
Indeed, when we strengthen and homogenize the linear system, we greatly increase the 
parameter r that is used by the D&C subroutine in these two algorithms, creating a very 
large number of new nodes that are added to the recursion tree of the D&C algorithm. 
This creates a significant increase in the run times of these new algorithms, and as a result, 
they cannot compete practically. 
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Table 3: Test Results for Random Problems Bounded by the 0-1 Cube [0, 1] 





Chubanov Relaxation 


Chubanov D&C 


Original Relaxation 


Experiment 


Recursions 
(Avg/SD) 


Time (Sec) 
(Avg/SD) 


Recursions 
(Avg/SD) 


Time (Sec) 
(Avg/SD) 


Iterations 
(Avg/SD) 


Time (Sec) 
(Avg/SD) 


Random 2D 


61420/ 



12.309/ 
0.2771 


64.7/ 
21.679 


0.0051/ 
0.0026 


0.5/ 
0.5 


0.0004/ 
0.0003 


Random 3D 


151522/ 



30.057/ 
0.2518 


1258.3/ 
3799.59 


0.2122/ 
0.6538 


0.75/ 
0.5 


0.0005/ 
0.0002 


Random 4D 


520152/ 



102.52/ 
0.7359 


419657.3/ 
1.1198 x 10 6 


70.566/ 
187.59 


0.6667/ 
0.5774 


0.0004/ 
0.0003 


Random 5D 


1.012 x 10 6 / 



199.94/ 
1.8034 


1.3395 x 10 6 / 
1.6308 x 10 6 


247.76/ 
304.01 


0.3333/ 
0.5774 


0.0002/ 
0.0001 


Random 6D 


1.733 x 10 e / 



344.36/ 
1.0829 


774996.6/ 
1.63 x 10 6 


140.28/ 
250.41 


20/ 
26.870 


0.0015/ 
0.0016 


Random 7D 


2.6938 x 10 6 / 
68925 


544.78/ 
19.641 


772084.4/ 
1.6303 x 10 6 


120.01/ 
252.98 






Random 8D 






165.3/ 
355.41 


0.0108/ 
0.0209 


0.6667/ 
0.5774 


0.0004/ 
0.0002 


Random 9D 






309896.5/ 
979808.4 


60.004/ 
189.74 


V 



0.0004/ 



Random 10D 






32224.1/ 
101037.4 


2.0451/ 
6.3976 


239/ 



0.0127/ 
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